Experiments in Binning Image Statistics

نویسنده

  • Nimar S. Arora
چکیده

Various vision tasks require the computation of image statistics and aggregating them into histograms. These histograms are usually compared using the χ distance which gives a rough idea of the similarity of the two image patches. For example, in [2] a histogram of key-points is collected on a rectangular grid for the category recognition task. Other researchers, [3], have used local image statistics around a point to find corresponding points. In general, image statistics produce a vast array of number of varying magnitudes. Some form of aggregation is required to ensure robustness. Although k-means clustering is a commonly used device in this setting, the Euclidean distance on which it is based is not always the appropriate measure of distance. A pre-processing step where the statistics are aggregated into histograms is quite helpful. In this study, we study the issues surrounding binning these statistics by feature location and feature magnitude. Where, a feature can be interpreted as the response of an arbitrary filter or key-point detection algorithm. We claim that the optimal binning strategy depends on the particular feature being computed. Towards this end, we try various strategies on a fixed set of features and evaluate their performance on a specific task. The task that we consider is object categorization using the CalTech 101 dataset [1]. The overall approach is to use a nearest neighbour classifier using the χ distance between the feature histograms of the test image and the training images. In the subsequent sections we describe the features that we collected and the performance of various binning strategies that we evaluated on these features.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image mapping using local and global statistics

We describe a set of techniques for mapping one image to another based on the statistics of a training set. We apply these techniques to the problems of image denoising and superresolution, but they should also be useful for many vision problems where training data are available. Given a local feature vector computed from an input image patch, we learn to estimate a subband coefficient of the o...

متن کامل

Compression of Remote Sensing Images for the PROBA-V Satellite Mission

We investigate compression of remote sensing images with a special geometry of non-square pixels. Two fundamentally different data reduction strategies are compared: a combination of pixel binning with near lossless compression and a method operating at higher compression rates. To measure the real impact of the compression, the image processing flow upto final products is included in the exper...

متن کامل

Detection- and Information-theoretic Analysis of Steganography and Fingerprinting

The proliferation of multimedia and the advent of the Internet and other public networks have created many new applications of information hiding in multimedia security and forensics. This dissertation focuses on two of these application scenarios: steganography (and its counter problem, steganalysis), and fingerprinting. First, from a detection-theoretic perspective, we quantify the detectabil...

متن کامل

Using visible SNR (vSNR) to compare the image quality of pixel binning and digital resizing

We introduce a new metric, the visible signal-to-noise ratio (vSNR), to analyze how pixel-binning and resizing methods influence noise visibility in uniform areas of an image. The vSNR is the inverse of the standard deviation of the SCIELAB representation of a uniform field; its units are 1/ΔE. The vSNR metric can be used in simulations to predict how imaging system components affect noise visi...

متن کامل

Using visible SNR (vSNR) to compare image quality of pixel binning and digital resizing

We introduce a new metric, the visible signal-to-noise ratio (vSNR), to analyze how pixel-binning and resizing methods influence noise visibility in uniform areas of an image. The vSNR is the inverse of the standard deviation of the SCIELAB representation of a uniform field; its units are 1/ΔE. The vSNR metric can be used in simulations to predict how imaging system components affect noise visi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007